中国科学技术信息研究所--国家工程技术数字图书馆

1. A unified memory network architecture for in-memory computing in commodity servers

[会议] Jia Zhan Itir Akgun Jishen Zhao Al Davis Paolo Faraboschi Yuangang Wang Yuan Xie International Symposium on Microarchitecture 2016年49th届共 14 页

摘要 : In-memory computing is emerging as a promising paradigm in commodity servers to accelerate data-intensive processing by striving to keep the entire dataset in DRAM. To address the tremendous pressure on the main memory system, dis... 展开

关键词 : Sockets Random access memory Servers Memory management Sparks Fabrics

原文获取

2. A unified memory network architecture for in-memory computing in commodity servers

[会议] Jia Zhan Itir Akgun Jishen Zhao Al Davis Paolo Faraboschi Yuangang Wang Yuan Xie Annual IEEE/ACM International Symposium on Microarchitecture 2016年49th届共 14 页

关键词 : Sockets Random access memory Servers Memory management Sparks Fabrics

3. There and back again: Optimizing the interconnect in networks of memory cubes

[会议] Matthew Poremba Itir Akgun Jieming Yin Onur Kayiran Yuan Xie Gabriel H. Loh International Symposium on Computer Architecture 2017年44th届共 13 页

摘要 : High-performance computing, enterprise, and datacenter servers are driving demands for higher total memory capacity as well as memory performance. Memory “cubes” with high per-package capacity (from 3D integration) along with hi... 展开

关键词 : Manganese Memory management Servers Topology Random access memory Pins Network topology

原文获取

4. There and back again: Optimizing the interconnect in networks of memory cubes

[会议] Matthew Poremba Itir Akgun Jieming Yin Onur Kayiran Yuan Xie Gabriel H. Loh ACM/IEEE Annual International Symposium on Computer Architecture 2017年44th届共 13 页

关键词 : Manganese Memory management Servers Topology Random access memory Pins Network topology

5. Investigation of Cost-Optimal Network-on-Chip for Passive and Active Interposer Systems

[会议] Dylan Stow Itir Akgun Yuan Xie ACM/IEEE International Workshop on System Level Interconnect Prediction 2019年共 8 页

摘要 : Interposer-based packaging is becoming a widespread methodology for tightly integrating multiple heterogeneous dies into a single package, with the potential to improve manufacturing yield and build larger-than-reticle-sized syste... 展开

关键词 : Manufacturing Integrated circuit interconnections Clocks Network-on-chip Bandwidth Delays Transistors

原文获取

6. Investigation of Cost-Optimal Network-on-Chip for Passive and Active Interposer Systems

[会议] Dylan Stow Itir Akgun Yuan Xie ACM/IEEE International Workshop on System Level Interconnect Prediction 2019年共 8 页

关键词 : Manufacturing Integrated circuit interconnections Clocks Network-on-chip Bandwidth Delays Transistors

7. Scalable memory fabric for silicon interposer-based multi-core systems

[会议] Itir Akgun Jia Zhan Yuangang Wang Yuan Xie IEEE International Conference on Computer Design 2016年34th届共 8 页

摘要 : Three-dimensional (3D) integration is considered as a solution to overcome capacity, bandwidth, and performance limitations of memories. However, due to thermal challenges and cost issues, industry embraced 2.5D implementation for... 展开 Three-dimensional (3D) integration is considered as a solution to overcome capacity, bandwidth, and performance limitations of memories. However, due to thermal challenges and cost issues, industry embraced 2.5D implementation for integrating die-stacked memories with large-scale designs, which is enabled by silicon interposer technology that integrates processors and multiple modules of 3D-stacked memories in the same package. Previous work has adopted Network-on-Chip (NoC) concepts for the communication fabric of 3D designs, but the design of a scalable processor-memory interconnect for 2.5D integration remains elusive. Therefore, in this work, we first explore different network topologies for integrating CPUs and memories in a silicon interposer-based multi-core system and reveal that simple point-to-point connections cannot reach the full potential of the memory performance due to bandwidth limitations, especially as more and more memory modules are needed to enable emerging applications with high memory capacity and bandwidth demand, such as in-memory computing. To overcome this scaling problem, we propose a memory network design to directly connect all the memory modules, utilizing the existing routing resource of silicon interposers in 2.5D designs. Observing the unique network traffic in our design, we present a design space exploration that evaluates network topologies and routing algorithms, taking process node and interposer technology design decisions into account. We implement an event-driven simulator to evaluate our proposed memory network in silicon interposer (MemNiSI) design with synthetic traffic as well as real in-memory computing workloads. Our experimental results show that compared to baseline designs, MemNiSI topology reduces the average packet latency by up to 15.3% and Choose Fastest Path (CFP) algorithm further reduces by up to 8.0%. Our scheme can utilize the potential of integrated stacked memory effectively while providing better scalability and infrastructure for large-scale silicon interposer-based 2.5D designs. 收起

关键词 : Silicon Bandwidth Three-dimensional displays Network topology Topology Memory management Routing

原文获取

8. Scalable memory fabric for silicon interposer-based multi-core systems

[会议] Itir Akgun Jia Zhan Yuangang Wang Yuan Xie International conference on computer design 2016年34th届共 8 页

关键词 : Silicon Bandwidth Three-dimensional displays Network topology Topology Memory management Routing